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Abstract 

By revisiting the Kirchhoff's Matrix- Tree Theorem, we give an exact formula for the number 
of spanning trees of a graph in terms of the quantum relative entropy between the maximally 
mixed state and another state specifically obtained from the graph. We use properties of the 
quantum relative entropy to prove tight bounds for the number of spanning trees in terms of 



Q^, basic parameters like degrees and number of vertices. 

a '. 1 Introduction 

CT'' 

The Kirchhoff's Matrix-Tree Theorem is a classic result stating that the total number of spanning 
trees of a graph is exactly the determinant of any principal minor of the Laplacian matrix of the 
graph. The theorem can be traced back to 1847 [8]. The number of spanning trees of a graph, 
OO ■ which is also called the complexity of the graph, is a valuable invariant (i.e., a quantity that does 

, not depend on the ordering of the vertices). There are various and diverse applications of the 

I number of spanning trees: an important role in the theory of electrical networks, for example, in 

computing driving point resistances [4|; a wide use in the study of graph-theoretic problems, like 
. routing, counting Eulerian tours, etc. [15]; the computation of network reliability [5]; and chemical 

modeling [6j. In quantum field theory, it is well-known that the value of a Feynman integral can 
be written in terms of spanning trees |TT]. The Tutte polynomial, object with a variety of uses in 
statistical mechanics, can be characterized via spanning trees |19j . 
^ I By reinterpreting the Kirchhoff's Matrix- Tree Theorem in the context of quantum informa- 

tion theory, we give an exact formula to count spanning trees based on the notion of quantum 
relative entropy. This function is the quantum mechanical analog of the relative entropy. It is 
central in the quantification and manipulation of quantum entanglement, quantum data compres- 
sion, communication costs, etc. \14:\ 117). It is traditionally interpreted as a parameter to quantify 
the distinguishability between two quantum states. We show that the number of spanning trees 
is proportional to the distinguishability/distance between a certain density matrix associated to 
the graph in context and the maximally mixed state, i.e., the state with maximum von Neumann 
entropy, or, equivalently, maximum amount of classical uncertainty. 
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By using standard machinery from quantum information theory, we study bounds on the number 
of spanning trees obtained from basic quantities Uke the number of vertices, edges, and degrees. We 
exhibit a tight bound; equahty is obtained for stars and certain multigraphs obtained by adding 
multiple edges to stars. Even if the bound is tight, it performs poorly in general. A potential 
improvement could be obtained by allowing different coefficients in the quantum state associated 
to the graph. 

The remainder of the paper is structured as follows: Section 2 contains the mathematical setup 
and the main result; Section 3 gives lower and upper bounds; Section 4 proposes a brief discussion 
and draws some conclusions. We give particular attention to a plausible operational meaning for 
the number of spanning trees, when considering our class of quantum states. 

2 Main result 

A graph is an ordered pair G = {V,E), where 1^ is a set of elements called vertices and E C 
V X V — {{i,i}, for every i G V} is set of unordered pairs of vertices called edges. Despite we 
mainly consider simple graphs, our treatment can be easily generalized to multigraphs, i.e., graph 
with multiple edges. A graph H = (W, F) is a subgraph of G if W (^V and F <^ E. The subgraph 
H is spanning if W = V. A cycle is a graph with set of vertices {0, 1, k — 1} and set of edges 
{{i, (i + 1) mod k} : i = 0,1, k — 1}. A tree is a graph without cycles as subgraphs. Denoting 
by \S\ the number of elements in a set S, the degree of a vertex i is the nonnegative integer 
d{{) = \{j : G E^\. The [combinatorial) Laplacian of G is the n x n matrix L = A — A, 

where Aij = d{i) 6ij {5ij is the Kronecker delta); Aij = 1 if {i,j} G E and Aij = 0, otherwise. The 
determinant of the n — 1 x n — 1 matrix L'^ obtained by deleting the i-th. row and column of L 
is independent of i. The Kirchhoff 's Matrix-Tree Theorem (see, e.g., |16j . Theorem VI.29) states 
that for the number of spanning trees of a graph G, t{G), we have 



Let Ti be an Hilbert space of dimension n with standard basis {|1), where the vector \i) 
is associated to the vertex i € V. The volume of G is denoted and defined as vol(G) = J2iev (0- 
By the Handshake Lemma, vol(G) = 2\E\. Denoting by {i\ the functional that sends \j) to the 
inner product {i\j), for each i,j = 1, ...,n, we define the matrix 



where \i — j) := \i) — It is promptly verified that p = L/Tr(A) = L/vol(G) [3]. In what follows, 
we write d = vol(G), in order to simplify the notation. We can treat p as the state of a quantum 
system with assigned Hilbert space H, given that p is a density matrix, being positive-semidefinite, 
and trace-one. In particular, we shall use two notions from the tool-box of quantum information 
theory: CPTP maps and the quantum relative entropy. 

Axiomatically, the evolution of a state a may be governed by a completely positive trace pre- 
serving (for short, CPTP) map ^> — ?> $ (cr) = Ylj -^j^-^j ^ some set {Kj} of operators on T-L 
such that KjKj = I, where / is the identity matrix (see, e.g., [12j). The von Neumann entropy 
of a density matrix a is defined by S (a) = —Tr(alna). Given density matrices ui and a2, the 




(1) 




(2) 
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quantum relative entropy of ai with respect to (T2 measures the difficulty of distinguishing between 
these states and it is defined by 

•SCo"! ||<T2) = Tr (cTi lno"i) — Tr (cri lna2) ■ 

The Kirchhoff's Matrix- Tree Theorem tells us that the number of spanning trees of a graph 
Gon n vertices, t(G), can be expressed in terms of the density matrix p. From Eqs. ([T]) and ([2]), 
we are prompted to the next statement: 

Theorem 1 Let G be a graph on n vertices and d/2 edges. The number of spanning trees of G is 

^(G)=(^)""'e-('^-i)^(Slt*^('')), (3) 

and also 

r(G) = (^)""'e-^EL.^(;ail*.(p)). (4) 

Here, Hi is a projection operator onto the n — 1 dimensional space spanned by all the elements of 
the standard basis associated to G, but the £-th; is the CPTP map defined by 

= litplit + QtpQi, (5) 

where 

Qi = i-iii = \e){i\ , 

is the complementary projector of Hi. 

(In matrix theory, the above transformation is called pinching and consists of removing all the 
off-diagonal terms related to the i-th vertex of the graph.) The derivation of Eq. ([3]) is simple and 
ultimately follows from the identity 

exp{-S{I/n\\M)) = iV(detM)i/", 

which holds for all positive semidefinite nxn matrices M. For a more explicit derivation, we firstly 
construct the nxn matrix which has zeros in the i-th row and in the ^-th column, but it is identical 
to the Laplacian L in the remaining entries. We can express it as 

UiLUi = dUipUi. (6) 

By construction, the matrix UiLHi will have the same spectrum of L'^ plus an extra zero eigenvalue. 
Recall in fact that is an (n — 1) x (n — 1) matrix, while HiLHe is n x n. Such an extra zero 
eigenvalue is associated with the eigenvector \£). Therefore, denoting by the eigenvalues of L'^, 
we can write 

r(G) = det (4) = n (7) 

ie<s' 

where the set S' contains all eigenvalues of HiLUi but the extra zero. This formula implies 

Inr(G) = ^InA', = Tr{Uiln{UiLUi)). 

ies' 
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(We adopt the standard convention OlnO = 0.) Thus, by Eqs. (j5l6p . 

In t(G) = {n- l)Tr(^ \u{dli,pli,)) 

= {n- l)Tr(;2^ Hll^pllt)) + (n - 1) Ind 
= (n - l)Tr(^ In $,(p)) + (n - 1) hi d, 

where in the last identity we used the fact that since Qi and 11^ are orthogonal projectors we have 

\n{liiplii + QipQi) = ln{UipUe)Ui + ln{QepQe)Qe. 

Noticing that n^/(n — 1) is positive semi-definite and has trace one, we can identify this operator 
with a density matrix. Indeed, it is the density matrix describing a maximally mixed state of the 
subspace orthogonal to It follows that 

Tr(^lncl>,(p)) = - S{^JUp)) 

= -Hn-l)-S{^,\\^,{p)). 

H6I1C6 

l^riG) = -in-l)S{^JMp)) +(n-l)ln^, 

which reduces to Eq. ([3]) by exponentiating it - Eq. is then a trivial consequence of Eq. 

If the graph does not have isolated vertices (i.e. if A is invertible, or d{i) > for every i (^V) 
an alternative (but equivalent) expression can be obtained as follows: 

-n5(|||$,{p)) 

r{G) = {ir'-^^ (8) 

" det(A)i/n ' 

where, for i G {1, • • • ,n}, is the ^-th diagonal element of the degree matrix A. If A is not 
invertible, i.e., if it has at least a zero diagonal entry, the graph G has no spanning tree. In this 
case, one can verify that Eq. ([8]) still applies under the assumption that i is not an isolated vertex 
of the graph. On the other hand, Eq. ([9]) cannot be used, as the denominator diverges. To include 
this case, we should simply say that if det A = then t(G) = 0, while if det A 7^ then Eq. ([9]) 
applies. To prove the above identity, it is sufficient to verify Eq. ([8]), since Eq. Q is obtained by 
multiplying such term over all possible values of i and by taking n-th root of the result. For Eq. ([8]), 
we go back to Eq. ([7]) and notice that 

InriG) = TrimnidUipUi)) 

= TT{Ue ln{UepUi)) + (n - 1) In d 

= Tr(ln <^i{p)) - ln{Ae/d) + (n - 1) In d 

= Tr(ln <^eip)) - ln(Af ) + nlnd, (10) 



where we used the fact that 



ln<l>i{p) = n^ln(n^pn^) + QeMQ^pQ^) 
= Ueln{UepUe) + Qeln{Ae/d). 
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Eq. ([8|) finally follows from 



TV(In$^(p)) =nTr(^ln$^(p)) = -nS{^\\^e{p)) - nlnn. 

Eqs. (|3l4l8|9p allow us to express t(G) in terms of the density matrix p and the relative entropy. 
It is worth noticing that it is possible to recast these equations in the following form: 

now is the density matrix obtained by normalizing L^, i.e., 

where ~ denotes equivalence. (Notice that the last term is an n x n matrix with a zero column 
and a zero row.) The derivation of this expression is exactly on the same line as the previous ones. 



3 Bounds 

We can derive lower and upper bounds on t(G) by exploiting known facts about the quantum 
relative entropy. On the other hand, deriving meaningful, simple lower bounds on t(G) with our 
technique does not seem to be natural. A first attempt based on the monotonicity of the relative 
entropy fails to produce even a trivial result, which is t(G) ^ 0. We discuss this case here only 
as an exercise. Firstly, observe that is a unital quantum channel {i.e., it maps the identity 

operator into itself ^£{I) = I) and that ii) the relative entropy is not increasing under CPTP maps, 
an important and nontrivial property: 

sip4p2)>smpi)mp2)). 

Using these two facts we can exhibit a lower bound for t(G): 

n 

= S{Ui)\mp)) ^ S{l\\p) = -Inn- i J^lnA, (11) 

i=i 

where A,- is the j-th eigenvalues of p. Replacing this into Eq. 1^, we thus get 



^ ' Api^ At 
_ (i"det(p) _ det(L) 

which is true for all I. Therefore taking the minimum over i, we can write 



(12) 



r(G) > (13) 

where bmin denotes the minimum degree of the graph. Similarly, replacing Eq. pip into Eq. Q, 
we get the inequality t(G) ^ det(L)/(det A)^/*^. Notice that these lower bounds are both trivial. 
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Indeed, since L has at least a null eigenvalue we have that det(L) = and thus the above inequalities 
simply states that t{G) ^ 0. It is worth observing that one could in principle get something less 
trivial by replacing the density p which appears in the r.h.s. of Eq. (jlip with a generic p state 
which satisfies the condition ^i{p) = ^i{p) (notice that such p can be easily constructed and that 
they have the property of having the same diagonal elements of p) . This would allow us to rewrite 
Eq. (flSl) as t{G) ^ det{L) / Smin-, where L = pd (in this case det(Z) needs not to be zero). 

Using the results in [1], we can determine upper bound on 5'(-||-) which, through our expressions, 
will result on bounds on t{G). None of the bounds seem to be particularly relevant (they are too 
weak and require to compute quantities which are not easily computable). As an example we just 
report here one of them which is obtained through the inequality 

5^(^111^2) ^ 



^min{P2) 

where \min{p2) is the smallest nonzero eigenvalues of p2- We apply this inequality to the identity 
of Eq. dH]). In this case, pi = I /n and p2 = ^i{p)- Hence, exploiting the properties of p, 



The negative fact is that we end up to compute the smallest eigenvalue of ^i{p) and this is only 
slightly less time demanding than computing the determinant (and of course the determinant of 
^e{p) is simply A<.r(G)). 

In the following, we derive several upper bounds for t{G). Before entering into the details of 
the derivation, we summarize the most relevant bounds: 

Theorem 2 Let G be a graph on n vertices and Then 

t{G) ^ min{To, ta, • • • , te}, 

where 

^0 = -^tr^ = det(A) / , 



TB - ir(A/nj , TC - 

_ Tr(A/n)" _ / Tr(A)^An,ax 



n-1 



The hound tq is tight in the sense that for each value of tq we can construct a graph H with an 
arbitrary number of vertices n ^ 2 such that 

t{H) = TO (H) . 

As a remark, it is worth observing that we can verify the following inequalities hold: 

To ^ TA ^ TB ^ TC, 
To ^TD ^ TB, 
To^TE TB- 

Also, there is not a definite ordering between tjj and ta- A further upper bound, worse than ta, is 

1— l/n 



rF=(T^^ det(A) 
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All the bounds in the theorem should be compared with the trivial value r^. 



rivial 



Tr(A/2) 
n — 1 



which simply follows by observing that any spanning tree has exactly n — 1 edges and that Tr(A/2) 
is the total number of edges of the graph. It turns out that there is not a definitive ordering among 
Ttriviai and the bounds that we derived above. Indeed, our bounds perform better on some graphs 
only. 

As an example, let us consider the following cases: 

• The complete graph, Kn- According to the Cayley formula, T{Kn) = n"~^. For K^, we have 
A = (n — 1)/, Ajiiax = n — 1, Tr(A) = n{n — 1), and det A = (n — 1)". Hence, all our bounds 

^ _ ^n-i ^ . . . _ f ^i^- l)/2 

in— 1 



coincide, i.e., tq = ta = tb = tc = td = te = {n - 1)*^ \ while nriviai , n-1 
which for n ^ 7 is already larger than 



The star, Ki^n-i- In this case, A has one eigenvalue equal to n — 1 (in fact, Amax = n — 1) and 
n — 1 eigenvalues equal to 1. Clearly, r(Ki^„_i) = 1. We have that tq = te = 1, a tight bound, 

while = (n - iy^'\ rs = [^J'", rc = rn = 

Notice that in this case ta < T£) for n ^ 6 and ta > td for n < 6. 



n / n—1 ' 



• Consider the graph with n = 4 vertices and edges {1, 2}, {2, 3}, {3, 4}, {2, 4}. One can easily 
verify that t{G) = 3. For this graph, we have tq = Ttriviai = 4, while ta — 6.45, tb = 8, 
TC = 8.6, T£) ~ 5.33, and te — 4.62. Notice that in this case ta > td. 

To prove the optimality of tq, we firstly observe that each graph G with n = 2 and multiple 
edges saturate the bound. Those graphs are only characterized by the number k of edges which 
connects the two element of V: therefore A = diag(A;, fc) and to(G) = k. For multigraphs on an 
arbitrary number of vertices n > 2, we consider stars with multiple edges only between an arbitrary 
but fixed pair of adjacent vertices. W.l.o.g. we may assume k edges between vertices 1 and 2 only. 
These multigraphs are denoted by i^i,n,fc- It is clear that T{Ki^n,k) = k. Furthermore, since A = 
diag(/c + (n — 1), /c, 1, • • • ,1), then also ro(-fCn,i,fc) = k. 

Both these bounds follow from the Klein inequality: the relative entropy of two states is always 
positive semi-definite. On the basis of Eq. we can conclude that 

n"A£ Ai 

for all £. Minimizing the r.h.s. with respect to such index, we can then write 

,n Trf-l" 

^(G) ^ = = rn, (15) 

Similarly, from Eq. Q, 

Tr(-)" 

^(^) ^ n«det(A)i/- = det(A)V" = ^^^^ 
Since A^ax ^ det(A)^/" ^ ^min, if follows that td ^ tc- 
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A more refined bound can be obtained by exploiting the following inequality (see [10]): 
^PiS{ri\\si) ^ Si^PiriW^qjSj) - ^Pilnipi/qi); 

i i j i 

this is valid if rj, Si are density matrices and pi and qi are generic probability distributions. Let us 
apply this to Y^i^i S{I /n\\^eip)), with pi = qi = 1/n: 

I E SiilW'^dp)) ^ ^(lll E ^^(^)/^) - ( V^) Hn/n) = S{l\\a). 

t e 

where 

cT = V ^i{p)/n = y^iUipUe + QipQi)/n = ^ - (1^) 



_ L 2 A 

d n d 

With the use of Eq. Q , we finally can write 



r(G) < ^" ^-nSih\a) ^ ddetja) 
n"det(A)V" det(A)V« 

_ det(A - ^A) _ det(L + 2A/n) 
det(A)i/n ~ det(Ai)/" 



(18) 



The bound is interesting but it still involves the computation of a determinant. We can however 
do better by using again the fact that the relative entropy is decreasing under the action of CPTP 
maps. Consider the CPTP map 



* = -E^^; (19) 



Now, 



n 

1=1 

n — 2 

^(A) = A, ^(A) = A. 

n 



d \ n j d 



where represent the CPTP map obtained by concatenating k times the map ^ {i.e. ^ is applied 
to £7 exactly k times). For ^ 1 this yields (in any norm), 

lim ^!^{a) = ^. 
fc— >-oo d 



Therefore, 



A fn-2\''+^A 



Siih) ^ S{^^{i)\ma)) = S{i\\^'{a)) = saw- - [-^ ) 
which in the limit of large k gives 
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Replacing this into the first line of Eq. (|18p . we finally obtain 

n"det(A)V" 

This upper bound, as the one of Eq. (I16p . is just a function of the matrix A. It is natural to ask 
which of the two bounds is better: the bound ([20}) is always better than (fT6]) . To prove this, take 



- = ^ 1, (21) 

TC Tr(A)n - ' ^ > 

which can be easily verified by exploiting the fact that ln(x) is a concave function of its argument. 
This clarify the relation between ta and tc- 

For the bound tb, we start form observing that Eq. ^ and ^ imply that the following 
inequality should apply for any t. 

A, . de.(A)V < '^'^T . 

where the last step is (again) a consequence of the Klein inequality. Now, 

ndet(A)V" (,-Trt=iS{i^\\^t{p)) ^ 



-E"=i5(^ll*,(p)) det(A)i/n Tr(A)- 

Recalling that Tr(A) = vol(G) and replacing this into Eq. ([9]), we have 

t{G) ^ Tr(A)-i = rB. 
We can now compare tb to and tq- Begin by writing 

rB_/ det(A) y^"_/r^y/" 
TC \tc) 



Thanks to Eq. (|2ip . we can thus conclude that 

TC Tc 

which implies tc ^ tb ^ ta- 

For To, we start from Eq. ^ which we specify for A^ = Amaxi ^-c, 



V"'/ '-»max 

where £ is the value of I achieving Amax- We then use the monotonicity of the relative entropy to 
write 



9 



where ^ is the unital CPTP map introduced in Eq. (jl9p . In writing the last passage of the first line 
we used the fact that since for each i and i' the channels <I>^ and commute, also ^ (and thus 
'^^) commute with The first identity in the second line is obtained for large k using the fact 
that ^^{p) — A/vol(G), and finally the last passage follows from the fact that for £, ^^(A) = A. 
Replacing this into Eq. (j22p . we finally get 

dy e-"^(nll^) _ f^Y »^"det(f ) _ det(A) 



V"-/ ^max V"/ ^max ^max 

To clarify the relations with the other bounds, we need only to observe that 

TO _ det(A)V" ^ ^ 

TA Ajjiax 

To derive te we use Eq. ([3]) and the Klein inequality, as we have already done above. This yields 

n-1 n, „ /, , sn-l ^Tr(A)-A, 



n— 1 



for all L The case A^ = Amax gives te] te > tq follows from the concavity of log(x); te < tb 
follows from A^ax 

^Tr(A)/n. 

By repeating the derivation of ta, starting from Eq. ([3|) instead of Eq. ([8]), we can derive a new 
upper bound, Tp- However, this turns out to be weaker than ta- Specifically, we get the following 
inequality: 



n 



n-1 



t{G) ^ (^;^J det(A)i-i/n = (23) 

To verify this, we use the joint convexity of the relative entropy and then its monotonicity under 
CPTP maps: 

i=l 1=1 i=l 

= S{I/n\\^{p))^S{^\l/n)\\^^+\p)) 

= 5(//n||'f^+i(p)) ~ S(//n||A/d), 



where ^ is the unital CPTP map introduced in Eq. (jl9p and where the last identity holds for 
k — )• oo. Replacing this inequality into Eq. ([3|), we finally obtain 

r{G) ^ ( ^e-^W-ll^/'^)V"' = ( ^ndet(A/d)i/"' 
\n — 1 J \n — 1 

which coincides with (1231). 



4 Conclusions 

We comment on the expressions that we have derived to see if these could help us in providing an 
operational meaning to t{G) or more generally to p. We notice that all the expressions derived so 
far allow us to write t(G) as a product of two quantities: a first term which is typically greater 
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than one and which depends only on the degree matrix of the graph; an exponential which is always 
smaller than one and which depends on the relative entropy between a density matrix obtained via 
some processing of p and a totally mixed density operator. In particular, we may point out that, 
apart from the cases of ([8]) and ([9]) where there is an extra factor inversely proportional to A^, the 
first contribution goes like ~ (d/n)", where d is the sum of the degrees. Since the quantity d/n 
measures the average degree, {d/n)"" counts the average number of independent walks one would 
obtain while hopping randomly n — 1 times along the edges of the graph while starting from a 
generic vertex. This number is clearly much larger than the number of spanning tree of G (the 
latter being just a proper subset of the trajectories generated by the hopping). The exponential 
term of our expressions can then be interpreted as the fraction of the trajectories which indeed 
correspond to a spanning tree of G. Their values are obtained by computing the relative entropy 
between a density matrix associated with G and a totally mixed state (either n dimensional or 
n — 1 dimensional). 

Invoking the quantum Stein's Lemma we can provide these quantities with an operational 
meaning in the context of the (asymmetric) quantum hypothesis testing problem (see [2]). It is 
known that S{pi\\p2) represents the optimal upper bound on type-II error rate for any sequence of 
measurements used to decide whether a given state is pi or p2, under the conditions of bounded 
type-I error. Let us recall that in the hypothesis testing problem, type-II (or false negative) errors 
are those in which we have mistaken p2 with pi; for type-I (or false positive) errors are those in 
which we have mistaken pi with p2. This implies that the exponential quantities in our expressions 
quantify how different is the quantum state associated to our graph from the totally mixed state. 

The study of the relation between relative entropy and the complexity of a graph which we have 
carried on in this paper is probably not exhaustive. It is plausible to believe that allowing freedom 
to add arbitrary coefficients to the projectors in the definition of p we can push the relative entropy 
closer to the effective number of spanning trees or even to a different graph-theoretic quantity. The 
associated problem would then closely resemble scenarios where we aim to optimize some quantity 
over a matrix fitting a graph under specific constraints (see, e.g., the Lovasz theta function, the 
Colin de Verdiere parameter, etc.). This appears to be an open research direction involving the 
relative entropy and possibly other standard notions from quantum information theory. 
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